Hyperparameter Auto-Tuning in Self-Supervised Robotic Learning

نویسندگان

چکیده

Policy optimization in reinforcement learning requires the selection of numerous hyperparameters across different environments. Fixing them incorrectly may negatively impact performance leading notably to insufficient or redundant learning. Insufficient (due convergence local optima) results under-performing policies whilst wastes time and resources. The effects are further exacerbated when using single solve multi-task problems. Observing that Evidence Lower Bound (ELBO) used Variational Auto-Encoders correlates with diversity image samples, we propose an auto-tuning technique based on ELBO for self-supervised Our approach can auto-tune three hyperparameters: replay buffer size, number policy gradient updates during each epoch, exploration steps epoch. We use a state-of-the-art robot framework (Reinforcement Learning Imagined Goals (RIG) Soft Actor-Critic) as baseline experimental verification. Experiments show our method online yields best at fraction computational Code, video, appendix simulated real-robot experiments be found project page www.JuanRojas.net/autotune.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Auto-tuning PID Controller for Robotic Manipulators

This paper suggests an auto-tuning method of PID trajectory tracking controller for robotic manipulators. In general, the PID trajectory tracking controller for mechanical systems shows the performance limitation. Since the control system including performance limitation can not have equilibrium points, we define newly the quasi-equilibrium region as an alternative for equilibrium point. Also, ...

متن کامل

Hyperparameter Learning for Graph Based Semi-supervised Learning Algorithms

Semi-supervised learning algorithms have been successfully applied in many applications with scarce labeled data, by utilizing the unlabeled data. One important category is graph based semi-supervised learning algorithms, for which the performance depends considerably on the quality of the graph, or its hyperparameters. In this paper, we deal with the less explored problem of learning the graph...

متن کامل

Parameter Auto-tuning Method Based on Self-learning Algorithm

The central air condition system is a complex system. Aimed at the puzzle of optimal status adjusting by once setting parameter of fuzzy PID, the paper proposed a sort of parameter auto-tuning method of fuzzy-PID based on self-learning algorithm. It adopted parameter autotuning technique to adjust the PID parameters in real time so as to ensure good quality of control system. It combined fuzzy ...

متن کامل

Collaborative hyperparameter tuning

Hyperparameter learning has traditionally been a manual task because of the limited number of trials. Today’s computing infrastructures allow bigger evaluation budgets, thus opening the way for algorithmic approaches. Recently, surrogate-based optimization was successfully applied to hyperparameter learning for deep belief networks and to WEKA classifiers. The methods combined brute force compu...

متن کامل

Threshold Auto-Tuning Metric Learning

It has been reported repeatedly that discriminative learning of distance metric boosts the pattern recognition performance. A weak point of ITML-based methods is that the distance threshold for similarity/dissimilarity constraints must be determined manually and it is sensitive to generalization performance, although the ITML-based methods enjoy an advantage that the Bregman projection framewor...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE robotics and automation letters

سال: 2021

ISSN: ['2377-3766']

DOI: https://doi.org/10.1109/lra.2021.3064509